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George Williams defined an evolutionary unit as hereditary information for which the selection 
bias between competing units dominates the informational decay caused by imperfect transmission. 
In this article, I extend Williams' approach to show that the ratio of selection bias to transmission 
bias provides a unifying framework for diverse biological problems. Specific examples include Hal- 
dane and Lande's mutation-selection balance, Eigen's error threshold and quasispecies, Van Valen's 
clade selection, Price's multilevel formulation of group selection, Szathmary and Demeter's evolu- 
tionary origin of primitive cells, Levin and Bull's short-sighted evolution of HIV virulence, Frank's 
timescale analysis of microbial metabolism, and Maynard Smith and Szathmary's major transi- 
tions in evolution. The insights from these diverse applications lead to a deeper understanding of 
kin selection, group selection, multilevel evolutionary analysis, and the philosophical problems of 
evolutionary units and individualitjRF] 



In evolutionary theory, a gene could be de- 
fined as any hereditary information for which 
there is a . . . selection bias equal to several or 
many times its rate of endogenous change [U 
p. 44]. 

INTRODUCTION 

Natural selection increases inherited information about 
environmental challenge. Against selection, imperfect 
transmission reduces inherited information. Many prob- 
lems in biology come down to understanding the relative 
balance between selection and imperfect transmission. 

A clear understanding of selection and transmission 
requires greater precision with regard to abstract notions 
such as inherited information. However, before heading 
off in pursuit of abstract theory, it pays to have some 
simple examples in mind. Those simple examples define 
the challenges for deeper theory. 

In this paper, I work through several examples that 
turn on the relative strength of selection and imperfect 
transmission: Haldane [5] and Lande's [3] balance be- 
tween selection and mutation, Eigen's [1] error threshold 
and quasispecies, Van Valen's [5] multilevel analysis of 
clade selection, Price's [5] multilevel analysis of group 
selection, Szathmary and Demeter's [7] stochastic cor- 
rector model of early cellular evolution, Levin and Bull's 
[8] short-sighted model of parasite evolution, Frank's [9] 
timescale model of microbial metabolism, and Maynard 
Smith and Szathmary's |10) major transitions in evolu- 
tion. 

Others have pointed out similarities between some of 
these examples [TDHH]. However, the broad unity with 
regard to selection and transmission is sometimes lost. In 
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addition, the key role of timescale, although often noted, 
has not always been linked to selection and transmission 
in a simple and general way. 

Williams' [L quote emphasizes timescale: the oppo- 
sition between selection bias and rate of endogenous 
change. An entity can be shaped by natural selection 
only to the extent that the informational gain by natural 
selection is not overwhelmed by the relative rate of infor- 
mational decay by imperfect transmission. The balance 
between selection and decay often turns on the relative 
timescales over which those forces operate. 



THE DECAY OF TRANSMISSION FIDELITY 

Many processes reduce the similarity between ancestor 
and descendant. In classical genetics, mutation changes 
the intrinsic quality of an allele during transmission. 
Mixing of alleles reduces transmission fidelity because of 
interactions with the changed combination of other al- 
leles. Internal selection changes the frequency of alleles 
within individuals, altering the similarity between ances- 
tor and descendant. 

Internal selection may occur within a pool of allele 
copies, in which certain alleles express traits that cause 
their frequency to increase against their neighbors [13] . 
For example, shortened mitochondrial genomes in certain 
yeast replicate faster than full genomes. The shortened 
genomes can rise in frequency within cells, even though 
they reduce individual-level fitness. In diploid Mendelian 
genetics, internal selection arises when traits increase al- 
lelic transmission to offspring to greater than the stan- 
dard Mendelian probability of one-half. 

Mutation or mixing of alleles may, in some cases, cause 
unbiased change during transmission. Unbiased change 
decays transmission fidelity, but does not affect the di- 
rection of evolution for the average value of traits. Unbi- 
ased change can increase the variation in traits by caus- 
ing random fluctuations in the characters expressed by 
descendants. Under stabilizing selection, the amount of 
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Box 1. Topics in the theory of natural selection 

This article is part of a series on natural selection. Al- 
though the theory of natural selection is simple, it remains 
endlessly contentious and difficult to apply. My goal is to 
make more accessible the concepts that are so important, yet 
either mostly unknown or widely misunderstood. I write in 
a nontechnical style, showing the key equations and results 
rather than providing full derivations or discussions of math- 
ematical problems. Boxes list technical issues and brief sum- 
maries of the literature. 



variation may be shaped by a balance between an in- 
crease caused by fluctuations in transmission and a de- 
crease caused by selection removing fluctuations from the 
favored value [3]. 

Biased mutation or internal selection causes a direc- 
tional change during transmission. When the directional 
change during transmission opposes selection between in- 
dividuals or groups, the balance between selection and 
transmission influences the average value of traits. 



SELECTION VERSUS TRANSMISSION 

Total evolutionary change can be partitioned into com- 
ponents of selection and transmission: 

Total change = Aselection + Atransmission, 

in which the symbol A means the change caused by the 
process of or the change in the quantity of depending 
on context. This partition of total change into selection 
and transmission is so important that it is worthwhile 
to express the partition with symbols. The symbolic 
form allows us to look at variations of the partition and 
the consequences for understanding evolutionary process 
(Box [2}. 

Total change can be expressed by the change in the av- 
erage value of some trait. Let Az be the change in the av- 
erage trait value. Do not be misled by the word average. 
We can consider the average of the squared deviations 
of a trait to measure the variance, or the average of the 
product of different characters to measure correlations, 
or the average frequency of an allele in the population, 
or any other expression leading to some quantity: Az is 
the change in whatever quantity we choose. We write 
total evolutionary change as wAz, where w is average 
fitness. Average fitness accounts for the total numbers 
of births and deaths, allowing us to express selection and 
transmission directly in proportion to total change (see 
Box[2|. 

Express the change caused by selection as AS and the 
change caused by transmission as At. Then the total 
change in symbols is 

wAz = AS + At. (1) 



Box 2. Price's selection and transmission 



The Price equation provides a useful separation between 
selection and transmission [6] 1141 115| . Much literature and 
misunderstanding descend from the Price equation. I will 
treat the topic fully in a later article. Here, I briefly summa- 
rize the essential concepts. My previous publications related 
to the Price equation provide further background [16H18] . 
Other key references lead into the broader literature |12l I19f - 

I used the Price equation as the basis for Eq. in the 
text. The Price equation may be written as 

wAz = Cov(™, z) + E(toAz). 

Comparing to Eq. Q, the selection bias is AS = Cov(u>,z). 
This simply says that the selection bias is the association 
between fitness and character value, where association is ex- 
pressed by the covariance. The transmission bias is At = 
E(iuAz). This says that the transmission bias is the average 
(expectation) of the change in character value, Az, between 
parent and offspring. The individual parent-offspring biases 
in transmission are weighted by parental fitness, w. If, for 
example, a parent reproduces little, then that parent's trans- 
mission bias contributes little to the average transmission bias 
in the population. 

The expression for selection in Eq. (|5| is derived as AS = 
Cov(w,z) = PwzV z , because the covariance of w and z is 
the product of the regression coefficient, ft of w on z, and 
the variance of z. Define s z = |/3™ z |, and apply a minus 
sign when /3 WZ < to obtain Eq. Q6D . See Frank [17] for the 
interpretation of these terms in the Price equation. 

In the mutation-selection balance models, either z = q is 
allele frequency or z is the squared deviation of a trait from 
the optimum. In either case, z is always positive, and the 
association between fitness and character value is negative. 
Thus, —s z — /3 mz , and we can express fitness in terms of the 
regression form 

E{w\z) = 1 + p wz z = 1 - s z z. (2) 

Here, I set maximum fitness to one. Any proportional change 
in maximum fitness is matched by the same proportional 
change in the regression coefficient, so the expression can 
be scaled arbitrarily. From this regression expression, the 
average of s z z must be less than one, otherwise average fit- 
ness drops below zero and mutational decay dominates se- 
lection, causing loss of heritable information or "mutational 
meltdown" [25] . 

Note that the regression expression E(w|z) = 1 — p wz z 
does not require a linear relation between character value and 
fitness. Rather, /3 WZ is simply the best least squares fit of 
fitness to trait value given the actual pattern by which trait 
values associate with fitness. 



Any evolutionary problem can be expressed in this way. 
But whether it is useful to do so depends on the particular 
problem and, to some extent, on one's preference between 
alternative ways to partition total change into various 
components. 

Selective improvement often pushes traits in the oppo- 
site direction from transmission decay. The balance be- 
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tween these opposing forces occurs when the total change 
is zero 

wAz = AS + At = 0, (3) 

which also means that at an equilibrium balance 

AS = -At. (4) 

This equation provides the ultimate expression of a bal- 
ance between selective improvement and transmission de- 
cay [IB]. 

We can often write the change caused by selection as 

AS = s z V z , (5) 

where s z is the selective intensity on the character z, and 
V z is the variance in the character z under selection (see 
Box [2]). If selection causes a decrease in the character, 
we would instead write 

AS = -s z V z (6) 

to express the negative contribution of selection to the 
change in character. Using these expressions for the 
change caused by selection in Eq. ^ , we obtain the equi- 
librium variance under a balance between selection and 
transmission as 



The absolute value is used because s z and V z are al- 
ways positive, whereas At may be positive or negative 
depending on whether the transmission bias increases or 
decreases the trait. The key point is that when the op- 
posing forces of selection and transmission are in balance, 
we have this simple expression for the variance of a char- 
acter. 



Box 3. What are groups? 



One must distinguish between two aspects. On the one 
hand, the fundamental theory works perfectly for essentially 
any conception of groups of alleles, individuals, or other en- 
tities. The groups do not require clear delineation, temporal 
continuity, or biologically meaningful interaction. Selection 
within groups simply means the differential success between 
entities in the group, however that differential success arises. 
Transmission bias simply means the fitness weighted change 
in character value between the entities in the group and their 
descendants. No restriction is placed on how the descendants 
themselves are arranged into groups. 

On the other hand, most potential groupings have no bi- 
ological meaning. One naturally prefers groups defined by di- 
rect interaction, temporal continuity, shared interest, and so 
on. Much literature debates alternative conceptions of mean- 
ingful groups 27-30 . Difficulty occurs because the relative 
value of alternative views varies with biological context, in- 
tellectual goal, and subjective bias about what is ultimately 
meaningful. Such undecidable alternatives attract endless de- 
bate and commentary. 

Discussion of biologically meaningful alternatives can lead 
to improved understanding as the weight of evidence accu- 
mulates for certain views. However, that discussion has often 
sought absolute conclusions, when in fact context and subjec- 
tive aspects necessarily play a role. In my view, one needs 
to keep in mind both the fundamental truth of the universal 
theory and the nuance of changing context and meaning in 
application. With both perspectives in mind, one never loses 
way. 



equal, as in the balance condition of Eq. Q. Increas- 
ingly positive values arise from greater dominance of se- 
lection bias, whereas increasingly negative values arise 
from greater dominance of transmission bias. Later ex- 
amples illustrate the application of this ratio. 



A MEASURE OF SELECTION VERSUS 
TRANSMISSION 

How exactly should we interpret Williams' phrase 
"hereditary information for which there is a . . . selection 
bias equal to several or many times its rate of endogenous 
change" ? We could evaluate the strength of selection bias 
relative to transmission bias to obtain a simple measure 
for the ratio, 1Z, between the forces. In particular, when 
the two terms oppose each other, we may write 

The negative sign appears because the opposing di- 
rections of change for AS and At mean that these 
terms have opposite signs. The negative sign makes the 
ratio positive. The logarithmic scaling imposes symme- 
try about zero. The ratio is zero when the two forces are 



MULTILEVEL SELECTION 

The individual typically comprises a group of alleles. 
In some cases, selection may occur between alleles within 
the individual. That selection within individuals creates 
a transmission bias between ancestors and descendants, 
because the sample of alleles transmitted to descendants 
is changed by selection between alleles within the ances- 
tor. The total change can be expressed as selection be- 
tween individuals plus the transmission bias created by 
selection within individuals. In this case, we can think 
of selection and transmission as the combination of two 
levels of selection [T5] . 

Now consider a population of individuals structured 
into groups. The total change may be partitioned into 
selection between groups and the transmission bias be- 
tween an ancestral group and the descendants derived 
from that group (Box [3]). Selection between individuals 
within the group will often strongly influence transmis- 
sion bias, because selection within the group changes the 
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composition of traits that are transmitted to descendants 
of that group. The total change can be expressed primar- 
ily as selection between groups and the transmission bias 
created by selection within groups. Once again, we can 
think of selection and transmission as the combination of 
two levels of selection. 



TIMESCALE 

The balance between selection and transmission de- 
pends on the rate of selection between groups relative to 
the rate of endogenous change within groups. Timescale 
influences the relative rates. 

Consider, for example, an increasing number of rounds 
of selection within groups for each round of selection be- 
tween groups. If there is some limit to the ultimate size 
of groups, then the transmission bias caused by selec- 
tion within groups increasingly dominates the selection 
between groups |31j . Similarly, an increase in the num- 
ber of rounds of replication within a lineage relative to 
the timescale of selection between lineages causes rela- 
tively greater mutation and decay during transmission 
compared with the selection bias. For example, the male 
mutation rate appears to be greater than the female mu- 
tation rate in several animal species, probably because of 
the greater number of replications per generation in the 
male germline |32j . 

It seems obvious that a relatively greater time for se- 
lection bias or transmission decay enhances the relative 
strength of a process. However, the simplicity of parti- 
tioning total change into selection and transmission in 
relation to timescale is not always developed clearly. By 
going through the examples properly, we can recover the 
simple conceptual unity that helps to explain a wide va- 
riety of biological problems. 



BALANCE BETWEEN SELECTION AND 
MUTATION 

Perhaps the most basic of all evolutionary theory con- 
cerns the balance between selection and mutation [2]. 
From Eq. 0, let the trait z = q be the frequency of 
a deleterious allele. The equilibrium balance between se- 
lection and mutation occurs when the rate at which selec- 
tion removes deleterious alleles equals the rate at which 
mutation adds new deleterious alleles. From Eq. Q, the 
balance occurs when AS = —At. From Eq. we can 
also express that balance as 



where V q is the variance in allele frequency, and s q is 
the selective intensity on allele frequency. In this case, 
mutation increases the frequency of the mutant allele, so 
At is positive and we do not need to use absolute values. 



Classic results of population genetics 

Suppose mutation changes a normal allele into a dele- 
terious allele. Once an allele has become deleterious, it 
cannot mutate back into a normal allele. Let the muta- 
tion rate of normal alleles be /i. Normal alleles occur at 
frequency 1 — q. Thus, the change in the number of mu- 
tant alleles caused by transmission bias is in proportion 
to At = fi(l — q). 

Selection reduces the reproductive success of mutant 
alleles by the selective intensity, s q = s. The variance 
in allele frequency is V q = q(l — q), the standard bino- 
mial expression for variance when sampling a single allele. 
Substituting these expressions into Eq. (|9j), the balance 
between selection and mutation occurs when the allele 
frequency is 

do) 

This result applies to haploid genetic systems, and at 
least approximately to diploid systems with dominant 
deleterious mutations under the commonly used assump- 
tions in population genetics. This expression captures 
the essential opposition between selective improvement 
and transmission decay that plays a key role in many bi- 
ological problems. For the following section, it will be 
useful to note that, from Eq. ([6]), the selection bias is 
AS = -sV q = -sq(l-q). 

To complete the classic treatment, I now write the case 
for a recessive mutation in diploid genetics. The muta- 
tion bias remains At = fi(l — q). For recessive alleles, the 
deleterious phenotype is only expressed in the homozy- 
gote, which occurs at frequency q 2 under random mating. 
Thus, selective intensity on each copy of the deleterious 
allele increases with the probability, q, that it will be 
mated with another deleterious allele, so the selective in- 
tensity is s q — sq. Substituting these expressions into 
Eq. Q yields the classic mutation-selection balance for 
recessive diploid genetics as 

For the following section, it will be useful to note that 
the selection bias against the deleterious allele is AS = 
-sqV q = sq 2 (l -q). 

Ratio of selection versus transmission 

The epigraph from Williams [T, emphasizes the relative 
strength of selection bias to transmission bias. That com- 
parison makes sense intuitively. However, when we use 
the results in this section to measure the relative strength 
of selection and transmission, the comparison turns out 
to be complex. The problem is that the relative strength 
of selection and transmission changes as evolution occurs 
in response to those forces. 
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log(s/n) = 5 



log(s/n) = 2 



log(s/n) = -1 



log(s/jx) = 5 



log(s/|i) = 2 



log(s/n) = -1 



-6 -4 -2 2 4 6 
Mutant frequency, log[q/(1-q)] 

FIG. 1. The relative dominance of selection bias versus trans- 
mission bias in models of selection and mutation. Relative 
dominance is measured by the ratio, TZ, of Eq. pj. (a) The 
diploid dominant or haploid model, (b) The diploid recessive 
model, in which AS 1 = — sq 2 (l — q) and TZ = log(sg 2 / fi) . All 
logarithms use base 10. 



For the simple models of selection and mutation in 
this section, Fig.[T]plots the relative strength of selection 
bias to transmission bias, TZ, from Eq. ffify. For example, 
Fig. [lji shows the first model with equilibrium q = fj,/s 
in Eq. (JToJ). In that case, AS = —sq(l — q) and At = 
— q), so the ratio is TZ = log(sqffi). 

The top curve of Fig.[l^ plots TZ for log(s//i) = 5. The 
plot scales the frequency of the mutant allele as log[g/ (1— 
q)]. That scaling puts the midpoint of zero at q = 0.5, 
with high and low frequencies scaling symmetrically and 
roughly logarithmically about the midpoint. 

In the top curve of Fig. [l£i, when the mutant frequency 
is not too low, selection bias is many times the trans- 



mission bias, TZ ^> 0. However, the mutant frequency 
evolves in response to the relative strength of selection 
and transmission. When selection is stronger, the mu- 
tant frequency, q, declines. As q declines, the ratio drops 
until TZ = 0, at which point the selection bias equals the 
transmission bias. Similarly, when q is very small, then 
the transmission bias is much greater than the selection 
bias, K « 0, and the mutant frequency increases until 
the point TZ — 0. 

The ratio of the selection bias to the transmission bias 
does not have a constant value. As mutant frequency 
changes, the relative dominance of the two forces shifts. 
The system comes to rest only when selection and trans- 
mission are in balance. Given the changing relation 
of selection and transmission, how should we interpret 
Williams' dictum? 

We could emphasize the example of the lower curve in 
Fig.[l}i. That curve never rises above zero, because trans- 
mission bias is always greater than selection bias for all 
frequencies. In that case, no hereditary information ac- 
cumulates. So we might say that hereditary information 
accumulates when selection bias is stronger than trans- 
mission bias for at least some conditions. But that is 
a rather weak statement, changing Williams apparently 
beautiful clarity into a muddle. 

Let us hold the point for now. As we go through vari- 
ous examples, we will see that the ratio of selection bias 
to transmission bias changes in response to key aspects 
of the particular problem under study. Rather than try- 
ing to abstract away how each particular problem shapes 
the changing ratio between selection and transmission, it 
may be more useful to use that ratio to understand each 
particular problem and the relations between different 
problems. 



Timescale 

Timescale arises implicitly in these models, because 
selection and transmission are both expressed per unit 
time. In the simplest models, one usually considers a 
single round of mutation per generation for each round 
of selection per generation. However, multiple rounds 
of mutation can occur for each round of selection. For 
example, many replications typically occur in the male 
germline of species that make large numbers of sperm. 
Those multiple replications occur for each round of se- 
lection. The multiple replications apparently increase 
the mutation rate in relation to the strength of selec- 
tion [52] • This change in the relative magnitudes of se- 
lection and mutation is important but not particularly 
profound. Later, we will see more interesting ways in 
which timescale alters the balance between selection bias 
and transmission bias. 
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VARIANCE UNDER A BALANCE BETWEEN 
MUTATION AND STABILIZING SELECTION 

Selection sometimes acts in a stabilizing way, pushing 
the average phenotype toward an intermediate optimum. 
Mutation opposes selection by spreading trait values and 
increasing the average distance from the optimum. The 
decrease in phenotypic variance caused by selection is 
opposed by the increase in variance caused by mutation. 

Here, I assume that all phenotypic variance is caused 
by simple genetics. This assumption allows me to focus 
on the processes that balance selection and mutation. I 
summarize the standard approach for this problem [51 1331 
[34] . following Frank and Slatkin [26] . 



mutation and stabilizing selection. However, V a is the 
variance of squared deviations 

V a = a 2 — a , 

and thus scales with the fourth power of deviations. Typ- 
ically, we seek expressions for the variance under stabiliz- 
ing selection rather than expressions scaled to the fourth 
power of deviations. We can, under two particular cases, 
reduce the fourth power expression to an expression for 
variance under stabilizing selection. 



Equilibrium variance 



General expressions 

Define the character of interest as 7 = z 2 , and set the 
optimum at zero, which is also the average value in this 
symmetric model. Then z 2 is the squared distance from 
the optimum, and the average of this squared distance 
is the variance. Using 7 as the character of interest, at 
mutation-selection balance, from Eqs. §f§ and ^ we have 



r. 



At 



(12) 



Suppose a mutation adds or subtracts c from the phe- 
notypic value, z. The two directions of change occur with 
equal probability. Thus, each mutation changes pheno- 
type by ±c. The contribution of each mutation to the 
change in squared deviation of phenotype, z 2 , is, on av- 
erage, c 2 . Mutations happen at a rate /i, so the change 
in the phenotypic variance caused by mutation is 

At = c 2 (i. 

The scaling c 2 translates between genetic mutations 
and phenotypic effects. We can use that same scaling 
to translate between the phenotypic scale, 7, and the 
genetic scale, a, with the relation 7 = c 2 a. Here, a is 
the squared deviation on the genetic scale, and 7 is the 
squared deviation on the phenotypic scale. The average 
of squared deviations is the variance, so we have 7 and 
a for the phenotypic and genetic variances, where the 
overbar denotes the average. 

The term V 7 is the variance of the squared phenotypic 
deviations, 7. Because a variance is itself a squared value, 
V-y summarizes the square of the squared deviations, thus 
scaled to the fourth power. Therefore, the proper relation 
to go from the phenotypic scale to the genetic scale is 
V y = c 4 V„. 



Substituting V 1 = c 4 V a and At = c/fi into Eq. (12) 



yields 



V a = ^ 
s 



(13) 



where s = c 2 s 7 . This expression for V a provides the most 
general solution for variation under a balance between 



When selection is much stronger than mutation, s 3> 
H, the general balance result of Eq. ( 13 ) is approximately 



aw-, 
s 



(14) 



where a is the variance on the genetic scale. Note that 
this result is essentially the same as the haploid mutation- 
selection balance result in Eq. ( 10 ) from the previous 
section. Box [4] provides the derivation. 

When selection is much weaker than mutation, s <C fJ., 
then 



(15) 



which matches the result for the diploid recessive model 
in Eq. (11). Here, fi = /u/2. With weak selection, most 
alleles deviate from the optimum of zero. At nonzero 
values, mutation is equally likely to move the allelic value 
closer or farther from the optimum. Thus, only one- 
half of mutations are deleterious, and fl expresses the 
deleterious mutation rate. Box [4] provides the derivation. 

Note that selection on phenotypes can be strong, yet 
the selection bias against each mutational step can be 
weak. Here, weak selection refers to the effect on each 
mutational step. In particular, I defined s = c 2 s 1 below 
If the phenotypic effect, c, of each mutation is 



Eq. (13) 

small, then strong selection on the phenotypic scale, s 7 , 
can be associated with weak selection on each mutational 
step of size c when expressed on the genetic scale, s. 



Mutation overwhelms selection 

If the decay in fitness by mutation exceeds the max- 
imum fitness that can be achieved, then mutation over- 
whelms selection. Mutation dominates selection when 
the magnitude of mutational effects is much greater than 
magnitude of selection, s <C [A, which corresponds to re- 
sults above for weak selection. 
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Box 4. Variance under mutation-selection balance 



To obtain the equilibrium genetic variance in Eq. ( 14 1 
when selection is much stronger than mutation, s»/i, note 
that 7 = c 2 a. Thus, with strong selection, most alleles will 
be at the optimum with 7 = a = 0, and a few alleles will be 
one mutational step away from the optimum at 7 = c 2 and 
a — 1 1.26]. Let the mutant frequency be q, so that a — 1 
with probability q, therefore a 2 — 1 with probability q. Thus 



a 2 = q and a 



With small q, we h ave g > g 



therefore V a 
Eq. ph. 



a, and thus Eq. ( 13 1 leads to 



To obtain the equilibrium genetic variance in Eq. ( 15 I 
when selection is much weaker than mutation, s <C \i, we 
assume that the distribution of allelic values approximately 
follows a Gaussian with a mean at zero [3] 1261 135] . With a 
Gaussian, the fourth moment is approximately three times 
the square of the second moment (variance), thus a 2 « 3a 2 
and V a — a 2 — a 2 w 2a 2 . Using this expression for V a in 



Eq. (131 yields Eq. (15 1 



From Eq. ([2| of Box [2] we can write fitness as w — 
1 — sa, using s = s a for selective intensity on the genetic 
character a. Thus, average fitness is w = 1 — sa and, 
using Eq. (Il5j) for a, we obtain w = l—Sy/ /t/s = 1 — s/Jis. 
Mutational meltdown occurs when w < 0, which implies 
(Is > 1. 

This condition simply means that the amount of dele- 
terious mutation, /t, scaled by the fitness consequence per 
mutation, s, reduces fitness by an amount that is greater 
than maximal fitness. The next section considers when 
the mutation rate might be so high. 



Eigen noted the paradox of the error threshold for early 
evolution. Without error correcting enzymes, the muta- 
tion rate was high. A high mutation rate limited the max- 
imum sequence length. A short sequence could not con- 
tain enough information to encode error correcting en- 
zymes. Without error correcting enzymes, the sequence 
remains too short to encode error correction. How did 
the biochemical machinery of error correction evolve? 

Eigen et al. (321 HQ] discussed a second interesting prop- 
erty of sequence evolution under mutation and selection. 
A population of sequences exists as a mixture of the most 
fit sequence and a variety of mutant sequences. Eigen 
called the most fit sequence the master sequence, and the 
population of sequences that are zero, one, two, or more 
mutational steps away from the master sequence the qua- 
sispecies. The term quasispecies is meant to differentiate 
a population of variants from a typological notion of a 
species as a fixed, nonvarying entity. 

The error threshold and the quasispecies are equiv- 
alent to the standard evolutionary concepts of heritable 
variation maintained by a balance between mutation and 
selection, as described in the previous section |4T| , The 
epigraph from Williams captured the key idea of the er- 
ror threshold by expressing the notion of a gene "as any 
hereditary information for which there is a . . . selection 
bias equal to several or many times its rate of endoge- 
nous change" [TJ p. 44]. The classical mutation-selection 
theory of Haldane, extended to the maintenance of vari- 
ation under stabilizing selection, expresses the concept 
of quasispecies. All of these theories have to do with the 
fundamental partition of total evolutionary change into a 
component of selection and a component of transmission 
fidelity. 



ERROR THRESHOLD AND QUASISPECIES 

Eigen applied the fundamental tension between mu- 
tation and selection to the evolution of nucleotide se- 
quences. In early evolution, the mutation rate was likely 
to be high because enzymes that correct replication er- 
rors did not yet exist. The initially high mutation rate 
and lack of error correction lead to Eigen's error thresh- 
old paradox |4ll5oU55]. 

Suppose the initial replicating sequences had a length 
of n nucleotides. If the mutation rate per nucleotide is 
/i, then the mutation rate per sequence is roughly nji. 
The deleterious effect per mutation is s. Thus, the ex- 
pected deleterious effect of mutation during each repli- 
cation of a sequence of length n is n/is. When the dele- 
terious effect per replication is greater than maximum 
fitness, here scaled to be one, mutation overwhelms se- 
lection and no selective increase in adaptation can be 
achieved. The condition for remaining below this error 
threshold is nfis < 1, which means that sequence length 
is limited to 

1 

n < — . 
/is 



MULTILEVEL ANALYSIS OF CLADE 
SELECTION 

Williams 42 argued that the relative rates of selection 
and transmission influence evolutionary change at all tax- 
onomic levels. Williams adopted the term clade selection 
from Stearns [43]. Van Valen's [5] analysis provides the 
clearest way to understand the ideas and potential im- 
portance. 

Van Valen [5] began by comparing the evolutionary his- 
tory of sexual and asexual types. He set up the problem 
by assuming that asexuals have a short-term advantage 
in growth rate relative to sexuals, and that sexuals have a 
long-term advantage with regard to avoiding extinction 
and forming new species [44j 05]. With those assump- 
tions, Van Valen (p. 87) suggests that one 

Consider a large set of species, some oblig- 
atory apomicts [asexuals] and some at least 
facultatively sexual. The apomicts will have 
a greater probability of extinction of lineages 
and the sexual species will have a greater 
probability of speciation by splitting of lin- 
eages. . . . However, apomicts will sometimes 



originate from sexual species because of their 
immediate advantage. 

Van Valen recognized two levels of selection. Cladcs 
with more sexual species will increase in species number 
relative to clades with fewer sexual species. Thus, sex 
has an advantage between clades. Within clades, asexu- 
als will arise repeatedly because of their short-term ad- 
vantage relative to their sexual ancestor. The selection 
within clades that favors asexuals can be thought of as 
a transmission bias: sexual species sometimes produce 
asexual descendants, whereas asexual species rarely pro- 
duce sexual descendants. 

Van Valen used the fact that one can express the two 
levels of selection as selection between clades and a trans- 
mission bias within clades to develop a simple model for 
the equilibrium frequency of asexuals. That equilibrium 
frequency balances selection bias between clades favor- 
ing sexuals, with rate s, against transmission bias within 
clades favoring asexuals, with rate fi, to obtain the ap- 
proximate equilibrium frequency of asexuals, q, as 

s 

This expression is the same as the standard model of 



mutation-selection balance in genetics given in Eq. ( 10 1 



In this model, Van Valen [5] emphasizes that selection 
at any taxonomic level is always potentially balanced 
against the rate of endogenous change at that level, echo- 
ing the epigraph from Williams. Endogenous change may 
arise in various ways, such as mutation by change of state 
or selection between the lower-level entities that comprise 
the higher level. 

Van Valen also applied this approach to mammals. In 
mammals, genera with larger body size survive longer 
than genera with smaller body size, but the smaller bod- 
ied genera bud off new genera at a higher rate. The net 
reproductive rate of small genera is higher, giving a selec- 
tive advantage to small bodied genera over large bodied 
genera. Within genera, there is a bias toward larger body 
size. The distribution of mammalian body size is influ- 
enced by the balance between selection between genera 
favoring smaller size and selection within genera favoring 
larger size. 

Various philosophical issues in the interpretation of 
clades as units have been taken up by Van Valen [46] . 
Williams [42] , and Okasha [12] . Here, I only applied the 
fact that one can partition the patterns of change at one 
level, such as clades, into components of selection and 
transmission. The philosophical issues focus on whether 
one can think of clades as natural units, for some reason- 
able meaning of natural. 



MULTILEVEL ANALYSIS OF KIN AND GROUP 
SELECTION 

Total evolutionary change includes a part caused by 
selection and a part caused by lack of fidelity in trans- 



mission (Eq. [IJ . In this section, I use that basic partition 
of total change to study two levels of selection, general- 
izing the model of clade selection in the prior section. 

At the higher level, a group may be any sort of col- 
lection. We may, for example, consider groups of in- 
dividuals or groups of alleles within an individual. Se- 
lection concerns differential success among groups. At 
the lower level, selection within groups causes a bias 
in the transmission fidelity of group-level characteristics 

EI[T3[T51|2S1I1Z|. 

The most interesting problems arise when selection 
among groups opposes the transmission bias caused by 
selection within groups. We may then consider a bal- 
ance between selection and transmission or, equivalcntly, 
a balance between the two levels of selection, AS = — At, 
as in Eq. Q. 

I present three aspects of multilevel selection. First, I 
write a very simple expression for the balance between 
the two levels of selection. This expression of balance 
provides the general basis for multilevel models of selec- 
tion and the analogy to the classical models of selection 
and mutation. 

Second, I apply the balance between different levels of 
selection to the tension between competition and coop- 
eration. That simple model illustrates how easily we can 
understand the basic processes of group- level cooperation 
within the broader framework of selection and transmis- 
sion. I also show the fundamental equivalence of group 
selection and kin selection models in group-structured 
populations. 

Third, I discuss the roles of population regulation and 
timescale. For population regulation, if limited space or 
resources regulates group productivity, then all groups 
may have roughly the same reproductive output. In that 
case, little selection occurs among groups, and the within 
group component of selection dominates [T51 HH1 . 
For timescale, the number of rounds of selection within 
groups relative to the rate of selection among groups sets 
the relative scaling of selection between the two levels. 
When the rate of selection within groups overwhelms the 
rate of selection among groups, then the within group 
component of selection dominates evolutionary process 

&■ 



The balance between levels of selection 

The fundamental equation for balance is AS = — At, 
the balance between selection bias and transmission bias. 
For multilevel selection, we interpret AS as the selection 
among groups and At as the transmission bias caused 
by selection within groups. For problems in which se- 
lection at the different levels pushes character values in 
opposing directions, we may rewrite the expression as 
AS a = —AS W , the balance between selection among 
groups and selection within groups. 

The change in a character caused by selection can be 
expressed as AS = sV, the product of the selective in- 
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tensity, s, and the variance in the character under selec- 
tion, V (see Box [2]). Thus, we may write the balance 
AS„ = -AS,,, as 



S a V a 



■jV w 



In a group-structured population, the total variance is 
the sum of the variance among groups and the variance 
within groups, which we express as V t = V a + V w . Making 
the substitution V w — Vt — V a yields 



s a V a = s w (V t -V a ). 



It is convenient to express the pattern of variance by the 
correlation coefficient r = V a /V t , where r measures the 
correlation in character values between individuals within 
a group. Dividing both sides by Vt yields 



,(l-r). 



(16) 



To understand this expression, we need to consider the 
interpretation of the correlation, r — V a /Vt- The corre- 
lation is the fraction of the total variance that is among 
groups. Because variance provides a weighting on se- 
lection, r can be thought of as the fraction of the total 
weighting of selection that happens at the group level, 
and 1 — r can be thought of as the fraction of the total 
weighting of selection that happens within groups. 

Thus, s a r is the intensity of selection among groups, 
s a , multiplied by the weighting of selection at the group 
level, r. At a balance, the group-level component must 
be equal and opposite to the intensity of selection within 
groups, s w , multiplied by the weighting of selection 
within groups, 1 — r. 

The correlation r is also a particular form of the regres- 
sion coefficient of relatedness from kin selection theory, 
as hinted initially by Hamilton |15l 152] following from 
the work of Price [6], and later analyzed more formally 
QUI [HI EH O [54]. The equivalence of r and Hamilton's 
formal theory of kin selection establishes the exact equiv- 
alence of multilevel group selection and kin selection. 



The tension between competition and cooperation 

We need an explicit expression for the relation between 
a trait and fitness in order to evaluate the abstract ex- 
pressions from the previous section. In this section, I 
present a simple model of competition and cooperation 

[531551. 

In a group-structured population, we can express fit- 
ness as the product of two components. The first com- 
ponent is the individual's relative share of total group 
success. The second component is the total success of 
the group. For the first component, we may write the 
individual's relative share of the group's success as z/z g , 
where z is the individual's tendency to be competitive 
against neighboring group members for access to local 
resources, and z g is the average competitive tendency in 



the individual's group. Selection within groups always 
favors greater competitive tendency, because an individ- 
ual's share of group success always rises with an increase 
in z. 

For the second component, total group success, sup- 
pose that the more intensely individuals compete against 
neighbors, the less efficient the group is in using its re- 
sources productively. For example, a certain fraction of 
local energy may go into outcompeting neighbors rather 
than enhancing productivity. We may express the neg- 
ative effect of competitiveness on group productivity by 
writing the total group productivity as 1 — z g , in which 
the total productivity declines as the group members' 
average tendency to compete, z g , rises. Thus, selection 
among groups always favors a less competitive and more 
cooperative behavioral tendency, because group success 
declines as average competitiveness, z g , rises. 

Putting the two pieces together, the fitness, w, of an 
individual with competitive tendency z, in a group with 
average competitive tendency, z g , is 
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f (1 - %) 



(17) 



To evaluate the balance between selection at the group 
level and selection within groups, we need to relate the 
expression for fitness to the particular selective tenden- 
cies and variance components in Eq. (16 1. The selective 
intensity among groups is s a = — 1, because group fitness 
is 1 — z g , and selective intensity is the change (partial 
derivative) in group fitness with change in the average 
trait value in the group. The selective intensity within 
groups is s w = (1 — z g ) I z g , which is the change in in- 
dividual fitness, w, with change in individual character 
value, z. 

Substituting these values for s a and s w into Eq. ( 16 ) 



yields a balance between group and individual selection 
when 



1 



-(1 



(18) 



Skipping over the technical details, we may say roughly 
that, in this case, selection acts in a stabilizing way, 
causing individuals trait values to converge toward a sin- 
gle value that is an evolutionarily stable strategy (ESS). 
Thus, individual values, z, converge toward group aver- 
ages, Zg, which in turn converge to a global value, z* . 
Making the substitution z g = z* and solving for z* gives 
the balance point [55, 56 as 



1 



(19) 



This balance point expresses the key insights of mul- 
tilevel selection and kin selection. In terms of multilevel 
selection, 1— r is the fraction of the total variance that oc- 
curs within groups. The greater this weighting of within- 
group selection, the higher the balancing point of z* , the 
tendency of individuals to compete with neighbors. As 
variance shifts toward the group level, 1 — r declines, z* 
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decreases because competitive restraint is more strongly 
favored, and the balance of selective forces increasingly 
favors cooperative behavior. In terms of kin selection, 
as the coefficient of relatedness, r, increases, competitive 
restraint and cooperative behavior rise. 



Population regulation and timescale 

Several factors may influence the intensity of selection 
within groups compared with the intensity of selection 
among groups [El |57] . For example, if limited space or 
resources regulates group productivity, then all groups 
may have roughly the same reproductive output. In that 
case, little selection occurs among groups, and the within 
group component of selection dominates [T5J [THl HSTfoTj 

In the model from the prior section, group productiv- 
ity was 1 — z g , and the change in group productivity with 
a change in the group phenotype, z g , was s a = — 1. Sup- 
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pose instead that the relation between group phenotype 
and group productivity is much weaker, because extrinsic 
aspects of space and resources limit group productivity. 



FIG. 2. The relative dominance of selection bias among 
groups versus transmission bias within groups in a multilevel 
selection model. Relative dominance is measured by the ra- 
For example, if group productivity is l-ez g , where e < 1, t io, TZ, of Eq. (20|. Different levels of relatedness shift the 

balance between selection bias among groups and transmis- 
sion bias within groups. Here, relatedness is measured by 
f — log [r/(l — r)]. 



then s a = — e. Using that value of s a in Eq. (18), we ob- 
tain the solution 

1 - r 



l-r(l- e) 



This solution is equivalent to Eq. (19) when e — 1. As 
limits on group productivity become more stringent, e de- 
clines toward zero, the balance tips more strongly in favor 
of selection within groups, the level of competitiveness, 
z* , increases, and, equivalently, the level of cooperation 
declines. 

There arc, of course, many complex ways in which indi- 
vidual traits may relate to group productivity and to the 
intensity of selection within groups. But all the differ- 
ent complexities tend to reduce to the simple balancing 
of forces between selection among groups versus bias in 
transmission fidelity of group characteristics or, equiv- 
alently, selection among groups versus selection within 
groups. If some additional force weakens selection among 
groups, then selection within groups increasingly domi- 
nates. Similarly, if some additional force weakens selec- 
tion within groups, then selection among groups increas- 
ingly dominates. 

Timescale provides another example. If the number 
of rounds of selection within groups increases relative to 
the pace of selection among groups, then selection within 
groups increasingly dominates the balance of forces [48l 
[55] , I discuss two particular cases in later sections on 
parasite virulence and microbial metabolism. 



Ratio of selection at different levels 

In multilevel selection, the relative strength of selective 
bias to transmission bias from Eq. pi) compares selection 



among groups to selection within groups. Substituting 
the expressions for AS and At derived from Eq. (17) 
into Eq. Q yields 



TZ = log 



1-z 



log 



1 - r 



(20) 



Fig. [2] plots TZ versus the level of competitiveness, z, for 
different levels of relatedness, r. 

The level of competitiveness is in equilibrium balance, 
z* , when the lines cross TZ = 0. At that point selection 
bias among groups is equal and opposite to transmission 
bias caused by selection within groups. Once again, we 
see that selective bias is greater than transmission bias 
only when the system is out of equilibrium. 

Following the epigraph from Williams, one may wish 
to think of groups as acquiring information, adaptation, 
or a degree of unitary function to the extent that se- 
lective bias tends to dominate transmission bias. Be- 
cause relative dominance depends on the phenotype, z, 
one interpretation would be that significant group-level 
function requires the relative dominance of selection over 
transmission across a wide range of possible phenotypes 
|30j . The range of phenotypes over which selection bias 
dominates transmission bias increases with a rise in relat- 
edness, r. Thus, one may say that increasing relatedness 
shifts the locus of information or adaptation toward the 
higher level. 

That interpretation of group-level unity or adaptation 
goes beyond what the analysis by itself presents. The 
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analysis simply describes the way in which 1Z shifts with 
competitive intensity and relatedness. The interpretation 
of group-level unity is a gloss that may aid or hinder un- 
derstanding in different contexts. Ultimately, one must 
retain a clear view of the underlying analytical basis. 



STOCHASTIC CORRECTOR MODEL OF EARLY 
PROTOCELLS 

Protocells are simple membrane-bound groups of genes 
that likely formed in early evolution [TU] . A model of pro- 
tocell evolution provides insight into group selection, kin 
selection, parasite virulence, and the evolution of sym- 
bionts [55J HH ED]- 

In the protocell model, the selective bias between cells 
opposes the transmission bias arising from selection be- 
tween genes within cells. Expanding on the epigraph 
from Williams [T], the degree to which adaptive design 
occurs at the protocell level versus the internal genie level 
depends on the selective bias between cells relative to the 
rate of endogenous change within cells. 

Each protocell can be thought of as a bag that starts 
with k pieces of genetic material (chromosomes). The 
chromosomes compete within the protocell for resources. 
Success at acquiring resources influences the rate at 
which chromosomes can replicate themselves within the 
cell. More competitive chromosomes use up local re- 
sources less efficiently and reduce the overall success of 
the protocell and its group of chromosomes. 

A protocell competes with other protocells for re- 
sources from the environment. A protocell produces 
a progeny cell after it has acquired sufficient resources 
and its chromosomes have replicated. The fitness of the 
protocell and its chromosomes depends on the rate of 
progeny production. Sampling of chromosomes occurs 
when progeny are formed: k chromosomes are chosen 
randomly from the pool of copies in the cell. I refer to 
this sampling process as segregation. 

This protocell model is a particular expression of the 
group selection model in the previous section. By study- 
ing this particular example, we can see more clearly how 
specific aspects of mutation, competition, and selection 
within groups affect transmission bias. 

Suppose that the fitness of a chromosome follows the 



expression in Eq. (17), repeated here 



w 



(1 - %) 



where z is a chromosome's tendency to be competitive 
against neighboring chromosomes for access to local re- 
sources within the protocell, and z g is the average com- 
petitive tendency of chromosomes in the protocell. Fol- 
lowing Eq. (19) of the previous section, the balance of 



selection between protocells and transmission bias within 
protocells is z* = 1 — r, where r is the kin selection co- 
efficient of relatedness among the chromosomes within a 
cell. 



Virulence and symbiosis 

The stochastic corrector model allows us to connect the 
abstract expressions from the multilevel analysis of kin 
and group selection to specific interpretations of parasite 
virulence and the evolution of symbionts within hosts 
[55l [59] , For virulence, one can think of each of the k 
chromosomes as a parasite, and one can think of the pro- 
tocell as the host. Competition between the parasites 
may cause inefficient use of host resources. Overexploita- 
tion of the host reduces host fitness. Thus, competition 
between parasites within hosts tends to increase viru- 
lence. The lower the relatedness, r, among the parasites 
within a host, the greater the competitiveness and viru- 
lence of the parasites, z* = 1 — r. 

We may also think of the k chromosomes as symbionts 
living within a host. From the host's point of view, in- 
creasing r reduces the competitiveness between the sym- 
bionts, aligning symbiont and host interests. In order to 
increase r, hosts may be favored to reduce the number, 
k, of symbionts transmitted to offspring or transmitted 
between hosts 6 lj . Hosts may also be favored to reduce 
the mixing of symbionts between different hosts [62] . 



Kin selection and group selection 

This model allows us to evaluate the meaning of the 
kin selection coefficient, r, within a particular scenario. 
Assume that transmission is purely vertical, because the 
chromosomes do not mix between cells. In this model of 
vertical transmission, three forces affect the evolution of 
competitiveness, z* = 1 — r. 

First, selection between protocells favors reduced com- 
petitiveness of chromosomes within cells, leading to 
greater efficiency at the cellular level. Against that cel- 
lular level effect, the competition and selection between 
chromosomes within cells causes a transmission bias that 
favors increased competitiveness of chromosomes. 

Second, mutations reduce the similarity among chro- 
mosomes within hosts, thus reducing r. The force im- 
posed by mutation is controlled by two parameters, the 
mutation rate, [A, and the change in character value 
caused by each mutation, S. Each mutational event 
changes z by ±<5, where the alternative directions of 
change occur with equal probability. Thus, mutation by 
itself causes no transmission bias. 

Third, segregation samples from the local chromo- 
somes when the protocell reproduces. Each new progeny 
starts with k chromosomal copies. When the cell re- 
produces, replicates of the local chromosomes are cho- 
sen stochastically according to relative fitness within the 
cell, zl z g . This sampling reduces the variance within 
hosts and increases relatedness. 

A stochastic computer simulation of this model showed 
that relatedness, r, and equilibrium trait values, z* , are 
held in balance by a delicate interaction among mutation, 
selection and segregation [55] . The observed equilibrium 
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trait values in the computer simulation closely follow the 
prediction z* = 1 — r, where r — V a /Vt is calculated 
directly from the simulation by measuring the within- 
ccll and total variances of trait values for the individual 
chromosomes in the population. The specific parameters 
affect variances and equilibrium trait values as expected: 
relatedness declines and z* rises as the mutation rate, //, 
or mutation step, 5, increases. An increase in the number 
of chromosomes per cell, k, causes an increase in com- 
petitiveness, z, because more copies reduce the variation 
among cells caused by sampling during segregation. 



Kin selection arises from patterns of variance, not 
genealogy 

The analysis in the previous section demonstrates that 
genealogy does not provide a sufficient explanation for 
the evolution of cooperative and competitive traits. The 
genealogical closeness between chromosomes in a cell in- 
creases as k declines. That genealogical aspect explains 
some of the changes in competitiveness, z* . But, for a 
fixed genealogical scheme and a fixed mutation rate, the 
magnitude of the effect of each mutation, 5, can strongly 
influence the equilibrium value, z* [551 . Larger muta- 
tional effects raise the variance within groups relative to 
the variance among groups, causing a decline in r, an 
increase in the strength of selection within cells, and an 
increase in the equilibrium competitiveness, z* . 

The theory of kin selection formulated by Hamilton 
[63] depends solely on patterns of variance and corre- 
lation, not on genealogy [T5]. Genealogy is often closely 
associated with patterns of variance and correlation. The 
simple protocell model illustrates how the association be- 
tween genealogy and the patterns of variance and corre- 
lation may break down. When the association breaks 
down, the true causal processes of variance and correla- 
tion explain the outcome. Since Hamilton's (63j work, 
no fundamentally derived theory of kin selection based 
on genealogy has existed. However, it is often convenient 
to use the fact that genealogy typically associates with 
the underlying causal processes of variance and correla- 
tion. That convenience has unfortunately confused many 
authors about the distinction between a convenient asso- 
ciation and the fundamental theory and its history. 

We may recover the association between genealogy and 
causal process if mixing of chromosomes between cells oc- 
curs. Such mixing often dominates mutation in determin- 
ing the patterns of variance within and among groups. In 
that case, genealogy may become the main force deter- 
mining r and the equilibrium level of competitiveness, 
z*. 

In conclusion, the mutation rate and the size of muta- 
tional effects primarily influence the patterns of variance 
under some conditions, whereas the migration rate and 
genealogy primarily influence the patterns of variance un- 
der other conditions. It is the patterns of variance and 
correlation that determine outcome. 



Reasons to favor kin selection over group selection 

Kin and group selection follow the same partition of 
variance within and among groups. A group selection 
analysis tends to emphasize the variance among groups 
and therefore the effect of selection at the group level. A 
kin selection analysis tends to emphasize the correlation 
between members of the same group, measured by the 
kin selection coefficient. The correlation within groups 
and the relative amount of variation among groups are 
simply alternative ways of expressing the partitioning of 
variances gS]. 

In more complicated biological problems, it often be- 
comes difficult to express all of the selective forces in 
terms of relative variances among groups. The problem 
is that patterns of interaction may differ with respect to 
different processes, such as mating, competition between 
certain individuals such as males, and competition be- 
tween other individuals such as females. In that sort of 
realistic scenario, it is far easier to trace pathways of cau- 
sation through a series of partial correlations that can be 
interpreted as an extended form of kin selection analysis 
[TBI BE]- In practice, it is rarely sensible to express such 
multiple pathways of causation by expressions of rela- 
tive amounts of variance among groups, although such 
expressions may be possible mathematically. For that 
reason, kin selection often becomes a more natural form 
of analysis for realistic biological problems, leading to a 
generalized path analysis framework. 

The present article is about the separation between se- 
lection and transmission rather than a general approach 
to pathways of causation. Frank [171 118) summarized the 
path analysis approach, although some readers may find 
those publications a bit technical. I will return to the 
path analysis methods in a later article in this series. 



SHORT-SIGHTED PARASITE EVOLUTION 

Within-group competitiveness often evolves, even 
though competitiveness reduces the equilibrium fitness 
of all individuals. The models in the previous sections 
provided examples. In those models, the favored value 



of competitiveness was given in Eq. ( 19 ) by z* = 1 — 



Competitiveness rises as relatedness between group mem- 
bers, r, declines. The equilibrium fitness from Eq. (17) 
is w — 1 — z* — r. Thus, reduced relatedness in groups 
increases competitiveness and causes a decline in fitness 
for all individuals and all groups. 

I mentioned one interpretation of this simple model in 
terms of parasite virulence. Parasites may compete for 
resources within the host. Greater competitiveness may 
lead to overexploitation of the host, harming the host and 
ultimately damaging or destroying the resource on which 
the parasites depend. In that regard, reduced relatedness 
of parasites within hosts may lead to enhanced competi- 
tion and greater virulence, where "virulence" means the 
degree of harm the parasites cause the host. 
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Levin and Bull [5] emphasized the key role of evolu- 
tionary timescale. A long period of within- host evolution, 
with many rounds of parasite competition and selection, 
may favor the origin and spread of increasing competi- 
tiveness between parasites, leading to greater virulence. 
That evolution of increasing virulence occurs during the 
time of an infection within a single host. Such evolution- 
ary increase of virulence can kill the host and, in con- 
sequence, kill the parasites themselves. In that regard, 
the newly evolved virulence is short-sighted, because it 
provides a local advantage to the parasites in the short 
run but leads to their extinction in the long run. 

If the highly virulent forms that evolve within the 
host rarely transmit to other hosts, then two distinct 
timescales exist. On the short timescale within hosts, 
high virulence repeatedly evolves but does not contribute 
to the long run evolution of the population. On the long 
timescale in the population of parasites across hosts, the 
less virulent forms transmit between hosts better than do 
the highly virulent forms, causing a moderate to low level 
of virulence among infective parasites entering a host. 

By contrast, if the highly virulent forms that evolve 
within hosts often transmit to other hosts, then the 
shorter and longer timescales interact. The short- 
term evolutionary increase of competitiveness within the 
host contributes to a transmission bias on the longer 
timescale. The contribution of the short-term increase in 
virulence within hosts to the longer timescale depends on 
the fraction of parasites transmitted between hosts that 
come from the later population within the host. The next 
section provides an example. 



DEMOGRAPHY, TIMESCALE AND MICROBIAL 
METABOLISM 

In this section, I consider groups that continuously pro- 
duce transmissible forms. The longer the time for evo- 
lution within groups, the greater the transmission bias 
toward characters favored within groups. For example, 
within-group selection often favors greater competitive- 
ness against neighbors. If many generations of selection 
occur within groups, the greater short-term pressure for 
competitiveness within groups ultimately increases the 
competitiveness across all groups. 

Microbial metabolism nicely illustrates aspects of 
timescale [5]. Extra energy devoted to resource ac- 
quisition speeds metabolic rate and competitive success 
against neighbors but reduces net efficiency and yield. 
Thus, the local benefit for rapid resource acquisition 
trades off against lower yield and reduced competitive 
success of a group against other groups [M] . 

Once again, we have a situation in which selection 
within groups favors more competitive traits, whereas se- 
lection between groups favors greater restraint and higher 
group productivity. The balance between opposing forces 
ultimately depends on the relative selective bias between 
groups compared with the transmission bias caused by 



selection within groups. 

An example 

Suppose that individual microbial colonies occur in 
separated patches. Each patch lasts for a while but even- 
tually disappears. During a patch's lifespan, there is a 
continual flow of resources available to the microbes. The 
microbes compete for the resources within the patch. 
Competition occurs by rate of resource uptake. Indi- 
viduals that invest more energy in uptake outcompete 
neighbors for resources, but their net conversion of re- 
source into reproduction is lower because they spend 
more on uptake rather than productivity. Groups that 
have highly competitive strains, devoting much energy 
to competitive increases in resource uptake, have low net 
productivity. 

Colonies continuously send out migrants in proportion 
to group productivity. Transmission bias occurs when 
the average competitive trait of migrants differs from the 
average trait value among those microbes that founded 
the colony. The local processes of competition, selec- 
tion, and production of migrants continue until colony 
extinction. Colony formation and colony extinction set 
the global birth-death process. 

The overall scenario is roughly similar to a host- 
parasite situation, in which resource patches are like 
hosts, and parasites send out transmissible progeny con- 
tinuously from an infected host. Many variations are 
possible. However, the basic setup provides a useful ex- 
pression for the interactions between colony demography 
and the different timescales of selection bias and trans- 
mission bias. 

I use the particular assumptions and results of Frank 
[9] . The interpretation follows the same type of selection- 
transmission balance of previous sections. However, the 
earlier models were often designed explicitly to illus- 
trate the partition between selective bias and transmis- 
sion bias. The value here arises from the more realistic 
biology, which forces us to parse the components of evo- 
lutionary change without the advantage of a toy model 
designed to give us a simple partition. 

No transmission bias 

Fig. [3] shows the net outcome of selective bias be- 
tween groups and transmission bias within groups. Each 
colony forms by a small group of genetically identical 
cells. When there is no mutation, as shown in the bottom 
curve, no selection can occur within the colony because 
there is no genetic variation. Thus, the bottom curve 
reflects the pure effects of selective bias between groups 
in the absence of transmission bias. The character, z, is 
the fraction of energy devoted to resource acquisition rel- 
ative to the fraction 1 — z devoted to reproduction. The 
character value at which equilibrium occurs is z* . 
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FIG. 3. The trade-off between rate and yield in microbial 
metabolism. The optimal trade-off, z* , is the fraction of avail- 
able resources invested in increasing the rate of acquiring new 
resources. The remainder of resources, 1 — 2*, enhances re- 
production. The colony survives each time period at rate S; 
the expected survival time is 1/5. Each colony begins with 
a single immigrant or small group of genetically identical im- 
migrants. The microbes use the local resources to reproduce. 
Mutations occur in the trade-off, z, between rate and yield. 
The lower curve represents no mutation in the colony. The 
middle curve has mutation rate, /i, and the upper curve has a 
higher mutation rate of lO/i. The colony sends out migrants 
to colonize new patches. The number of migrants per unit 
time for each genetic type in a patch is proportional to the 
number of cells of that genetic type. The details about rate 
processes are in Frank [S, . Redrawn from Fig. 2a of Frank [5] . 



To understand the consequences of a pure selective bias 
between groups, recall from Eq. ([I]) that the total change 
in a character is 

wAz = AS + At, 

the sum of the change caused by selective bias, AS, 
and transmission bias, At. Here, the biases are mea- 
sured with respect to microbial groups living in isolated 
patches. 

The character value settles to equilibrium when wAz — 
AS + At = 0. If there is no genetic variation among 
the initial microbes that start each colony, and no mu- 
tation, then there can be no selection within groups and 
no transmission bias, thus At = 0. With no transmis- 
sion bias, the system comes to equilibrium when AS = 0. 
Put another way, group productivity, which determines 
the selective bias between groups, AS, sets the trade-off 
between rate and yield. In the lower curve of Fig. [3] with 
no mutation, the value z* maximizes yield and leads to 
AS = sV = 0. 

In this particular model, one cannot write a simple ex- 
pression for the balance between rate and yield. Roughly, 
the idea is that a fraction z of energy is put into increas- 
ing the rate of resource acquisition, and a fraction 1 — 2 
is put into reproduction or yield. If the factors simply 



multiplied, then fitness would be w = z(l — z). The 
change in fitness with the character z gives the selec- 
tive coefficient, s. The change in fitness with character 
value is the derivative of w with respect to z, which gives 
s = 1 - 2z = 0, and so AS = sV = implies z* = 1/2. 

In the actual model, the length of colony survival af- 
fects the balance between rate and yield. Short-lived 
colonies are favored to grow quickly (high rate and low ef- 
ficiency) to use up available resources before extinction, 
whereas long-lived colonies are favored to grow slowly 
and use resources efficiently. Thus, in the lower curve of 
Fig. [3j longer colony survival causes the optimal balance 
to shift toward lower rate and higher yield. 



Balance between selection and transmission 

When mutation generates variation within colonies, 
then the rate-yield tradeoff balances selection between 
colonies and the transmission bias from selection within 
colonies. The upper two curves in Fig. [3] show the equi- 
librium balance, z* . The top curve has a mutation rate 
ten times greater than the middle curve. 

As colony survival increases, the equilibrium moves to- 
ward greater investment in resource acquisition. Higher 
resource acquisition and metabolic rate arise from the in- 
evitable production of mutant neighbors within colonies 
and the multiple rounds of internal selection within 
groups. With very long colony survival times, both up- 
per curves would converge to a high value of z* at which 
nearly all resources are devoted to resource acquisition 
and competition within colonies, with the yield efficiency 
dropping to a very low level. At that point, transmis- 
sion bias from selection within groups dominates selec- 
tion bias between groups. 

The equilibrium rate-yield tradeoff reflects the funda- 
mental balance between selection and transmission. That 
balance provides a simple conceptual basis for under- 
standing how natural selection shapes characters. How- 
ever, in this relatively realistic model, one cannot use 
the balance of Eq. Q directly to calculate the predicted 
outcome. Instead, I had to use other mathematical meth- 
ods to obtain the solution j9] . The selection-transmission 
balance only provides a framing in which to interpret the 
results. 

In the earlier models in this article, it was easy to cal- 
culate the ratio of selection to transmission, 1Z. Here, 
the calculation is difficult, and methods such as the Price 
equation, kin selection, and group selection are of no use 
in calculating the outcome. After obtaining a solution by 
other means, one can use those framings to interpret the 
forces that shaped the outcome. This limitation to post 
hoc explanations is typical of the grand theories when 
faced with realistic scenarios. Across the range of dif- 
ferent problems presented in this article, the selection 
versus transmission framing provides the most general 
conceptual view, following the spirit of the epigraph by 
Williams. 
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CONCLUSIONS 

This article is about the relative contributions of se- 
lective bias and transmission bias to overall evolutionary 
change. For any problem, we first choose a higher level 
of organization, such as a group, an individual, or a cell 
within a multicellular aggregation. Selective bias arises 
from differing success among the higher level entities. 
Transmission bias arises from changes in character val- 
ues between higher level entities and their descendants. 
Transmission biases may occur by mutation, by random 
fluctuations, and by selection within the group. 

The ratio of selective bias to transmission bias provides 
a simple measure for the relative dominance of the higher 
to the lower level of organization in overall evolutionary 
change. When the two levels oppose each other, then the 
relative dominance of one level over the other often sets 
the level at which functional coherence and individuality 
emerges. 

A key aspect of Maynard Smith and Szathmary's [TO] 
The Major Transitions in Evolution was expressed by 
Maynard Smith [SSJ pp. 229-230]: 

One can recognize in the evolution of life sev- 
eral revolutions in the way in which genetic 
information is organized. In each of these rev- 
olutions, there has been a conflict between se- 



lection at several levels. The achievement of 
individuality at the higher level has required 
that the disruptive effects of selection at the 
lower level be suppressed. 

Maynard Smith's suppression of disruptive effects at 
the lower level causes selective bias at the higher level 
to dominate. The quote and the conceptual basis of the 
major transitions therefore expresses Williams' notion of 
the ratio of selective bias to endogenous rate of change 

There is a large philosophical literature on the mean- 
ing of individuality and of units of selection in relation to 
levels of selection [HJ[S7]. One can certainly learn from 
studying that philosophical literature. However, I have 
found it more instructive to analyze a wide range of inter- 
esting biological problems, to discover in practice what 
is actually needed to understand those problems, and to 
learn what general concepts link the different problems 
within a common conceptual basis [cf . [SH [55] • Philosoph- 
ical induction from numerous evolutionary deductions. 
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